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(57) The invention provides a process for producing 
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by growing plant cells having the genetic information re- 
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cultivatingaplantorapart thereof comprising such plant 
cells, characterised in that said plant cells are grown, or 
said plant or a part thereof, is cultivated in the presence 
of a trehalase inhibitor. 
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Description 

HELD OF THE INVENTION 



10 



tr.ha.ose by „ etelase . The inv.n.ion Lhe, ZZ^S^S^^L £ a V '* iMI " 9 <* 
•aaich a .aw« suc h memods, contain reSfeW^Zl,^' ? ? ""5 "WW™*, ana pans Iheraol. 



STATE OF THE ART 



copyranoside is a widespread na^cS^^^ 

apart ,rcm a few exceptions, such as ^a^Zfe^,^^^ " ^ ,OUnd in p,a " ts - 

»W* Apart from these species, trehalose is l^^rS^^STff and Myrothamnus flabel- 

mas), wherein it is synthesized by bacteroids- the trehaSe SnrZ^lT Legum ' nosae (Spermatophytae, Angiosper- 
from these accidental occurrences, p ten spec 2 betoSa toT.T VT " dmSh9 M ° the root cells A Pa« 
and/or accumulate trehalose. P belonging to the Spermatoprtyra apparently lack the ability to produce 

In International patent application WO 95/01446 filed on lima -vi i OQ /- 
a method is described for providing plants not naiuTal v canlto, J 6 name °' MOGEN '"National "V 

In spite of the absence of trena'ose as 'T^lST™? 9 "" h the Capac ^ t0 do s °- 

degrading activity has been reported for a conside^bTe n mbTof hSfer n , ° CCUrrenCe ° f ,reha,ose " 

trehalose. The responsible actrvrty could be attnbTed" ^renale^^ SPe ° ,eS ' ' nC,Udin9 ,hOSe knw " n ,0 ,aCk 

to be generally more sensitive to the adverse Sectfof reha o!l t h , P '° W tfehalaSe tevels we ' e ^ 

activity. Trehalose-analogs, such as trehalose lr w?' than p,ants exhibitin 9 * higher level of trehalase 

sib,e to study the effects of trehalose ZES^^S^J?? Sh °° ,S ' makin 9 » P~ 

were adversely affected by the addition oUr^X2Z^Kr -.T high amounfs of treha '^ 

and suspension culture of various ^^^7^ 

29. 2525-2582. spe^ae using Val.damyc.n is disclosed by Kendall etai. 1 990, Phytochemistry 

It is an object of the present invention to proves p,ants and p.ant parts capable of producing and accumu.ating 



SUMMARY OF THE INVENTION 



» n^ 

or a part thereof comprising such plant cells charac tarit^n ST ^^f 3 ' 056 and trehalase, or cultivating a plant 
thereof, is cuitfvated in thepresencf 2 Vl ^2^ZTp^^tl *? ^ °' P ^ " a part 
genehcally altered so astocontainachimerictrehalose^nh^!!!? P P ' am partS ° r plant ce,ls have b ^ 

to one embodiment said trehalose ph^^SS^ItZ!^ 9608 ' P ' ant eXpr6SSib ' e ,0rm Accordin ° 
Phosphate synthase from E co//in plant exoresStol C ° mP an open readi "9 frame encoding trehalose 

prefe^yTce^r t TuT^^sTo FT * *»" *° ** * ^ 
reading frame encoding lreha|ose ^0^^^^"^^ T^ 9 '° ^ embodimen « *• °pen 
provide for preferential expression o, th gene TSS a^mb^ubetTfr °' ^ paWh pr ° moter ' to 
According to another asoect of th* inlntTJ,, 1 , m,cr o-tubers of Solanum tuberosum. 
Accord,^ anot J p a ^ m for exam P>e hydrocuiture. 

uptake by said P^c^pr*^^ 

mM, in aqueous solution. oetween 100 nM and 10 mM. preferably between 0.1 and 1 

a ^^.X^trrSe 0 *" ^ tomM " y -■->« «• - — . 9.na ,o 
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transformed with DNA coding for said protein. 

The invention further provides plants and plant parts which accumulate Irehalose in an amount above 0.01 /<• 
(fresh weight), preferably of a Sofenaceaespecies, in particular Solanum tuberosumor Nicotiana tabacum, in particular 
a micro-tuber of Solanum tuberosum containing trehalose. ... 

5 The invention also comprises the use of a plant, or plant part, according to the invention for extracting trehalose, 

as well as the use thereof in a process of forced extraction of water from said plant or plant part. According to yet 
another embodiment of the invention a chimaeric plant expressible gene is provided, comprising in sequence a tran- 
scription initiation region obtainable from a gene, preferentially expressed in a plant part, particularly the patatm gene 
from Solanum tuberosum, a 5'-untranslated leader, an open reading frame encoding a trehalose phosphate synthase 

w activity and downstream of said open reading frame a transcriptional terminator region. 

According to yet another embodiment of the invention a chimaeric plant expressible gene is provided, comprising 
in sequence a transcription initiation region obtainable from a gene, preferentially expressed in a plant part, part.cularly 
the patatin gene from Solanum tuberosum, a 5'-untranslated leader, an open reading frame encoding a trehalase 
coupled in the antisense orientalion, and downstream of said open reading frame a transcriptional term.nator region. 

is A preferred plant expressible gene according to the invention is one wherein said transcript.onal terminator region is 
obtainable from the proteinase inhibitor-ll gene of Solanum tuberosum. The invention also provided vectors and re- 
combinant plant genomes comprising a chimaeric plant expressible gene according to the invention, as well as a p an 
cell having a recombinant genome, a plant or a part thereof, consisting essentially of cells. A further preferred plant 
species according to this aspect is Solanum tuberosum, and a micro-tuber thereof. 

zo The invention further provides a process for obtaining trehalose, comprising the steps of growing plant cells ac- 

cording to the invention or cultivating a plant according to the invention and extracting trehalose from sa.d plant cells, 

plants or parts. 

The following figures further illustrate the invention. 

25 DESCRIPTION OF THE FIGURES 

Figure 1 . Schematic representation of binary vector pMOG845. 
Fi gure 2. Schematic representation of multi-copy vector pMOG1192. 

Fi gure 3 Alignments for maximal amino acid similarities of neutral trehalase from S. cerevisiae with penplasmatic 
trehalase from E coll, small intestinal trehalase from rabbit and trehalase from pupal midgut of the s.lkworm, Bombyx 
mori Identical residues among all trehalase enzymes are indicated in bold italics typeface. Conserved regions of the 
amino acid sequences were aligned to give the best fit. Gap's in the amino acid sequence are represented by dashes. 
Positions of degenerated primers based on conserved amino acids are indicated by dashed arrows. 

Fiqure 4 Alignment for maximal amino acid similarity of trehalases derived from E. coli (Ecoli2treh ; Ecolrtreha). 
silkworm (Bommotreha). yellow mealworm (Tenmotreha), rabbit (Rabbitreha), Solanum tuberosum cv. Kardal (Pota- 
totreha) and S cerevisiae (Yeasttreha). Gap's in the amino acid sequence are represented by dots. 

Figure 5. Trehalase activity in leaf samples of Nicotiana tabacum cv. Samsun NN. Non-transgenic control plants 
are indicated by letters a-1 , plants transgenic for pMOG1078 are indicated by numbers. 

Figure 6 Trehalose accumulation in microtubers induced on stem segments derived from Solanum tubersosum 
■ cv Kardal plants transgenic for both pMOG 845 (patatin driven TPS E coli expression) and pMOG1027 (35SCaMV an- 
tisense-trehalase expression). N indicates the total number of transgenic lines screened. Experiments were performed 
in duplicate resulting in two values: a and b. ND: not determined. 

DETAILED DESCRIPTION OF THE INVENTION 

According to the present invention it has been found that the accumulation of an increased level of trehalose in 
plants and plant parts is feasible. This important finding can be exploited by adapting plant systems to produce and/ 
or accumulate high levels of trehalose at lower cost. 

According to one aspect of the invention the accumulation of increased levels of trehalose is achieved by mhibitmg 
so endogenous trehalases. I nhibition of trehalases can be performed basically in two ways: by administration of trehalase 
inhibitors exogenously, and by the production of trehafese inhibitors endogenously, for instance by transforming the 
plants with DNA sequences coding for trehalase inhibitors. 

This inhibition can be equally well applied to plants which are transformed with enzymes which enable the pro- 
duction of trehalose, but also to plants which are able to synthesize trehalose naturally. 
55 According to this first embodiment of the invention, trehalase inhibitors are administered to the plant system ex- 

ogenously Examples of trehalase inhibitors that may be used in such a process according to Ihe invention are trehazol.n 
produced in Micromonospora, strain SANK 62390 (Ando etai, 1991. J. Antibiot. 44. 1165-1168), validoxylamine A B. 
G D-g/uco-Dihydrovalidoxylamine A. L-rcto-Dihydrovalidoxylamin A. Deoxynojirimycin (Kameda etal., 1987, J. Ant.biot. 
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(Perils americana) (Hayakawa et a,. 1 9 8 So. Che^C SEfiSZ™ *" 

tors are administered to plant o^L Trenalasa inhibi- 
parts or cultures. Typically fte treha Si inhMo £ t? h« T' " * ^ 8Utable f ° f Uptake by the P |ants - P ,an « 
of acth,e ingredient pre erabVSween ar?iT,T° US SO ' lJti ° n ° f b8tWeen 100 nM and 10 mM 
spraying on .eaves, watering^d^ ^s by 

- vaNdarnycin is solaco.. a commercia.ly avai.able agricu.tural format ^5tn^'^ l ^ Uto ° f 

- caned -antisense approach " ^kn^ SE;?^ Th,S S ° 
disclosed in WO 95/01446). A and the Exa ™P'es to inhibit SPS 

acid ^^Tolll^ZTeT^Z SSSSSS JeS? °T librarV ^ S6qUenCed - The > rediCted «*» 

trena^g^ ? * ^r^ 9 3 W * h the 

degree of homology can a s £ usedThts the aSZ? k f " 96ne ' HOWeVer ' SeqU6nCeS *** have a 
wi„ he directed against the nuclide Z^Z^SSSgi X l^T *" U ™ M ™ °< 
that the potato trehalase sequence can also be used Z inhfb^ ttSSi* a ' S ° demonstra ted in this application 

is highly homo,ogous to the tomato ,reha,ase equen e Thuf « JZ^^lT*'^'™*™" 
least in closely related species but mavbe also in mhZ „tli tk env,sa 9 ed tnat the P°^° sequence is usable at 
enough to express only part oHhe h^otoaous ° P n Tn th , " thS ^ conside ™9 that it is usually 

of expression^ the Sl^^^ -•*>» ***** 

i^r™^ 

also exhibits trehalose hydrolvsino activitv Di s *r7i;*rYHo , 901 1904 '. 1992 >- The obtained protein preparation 
purification steps can be — 

oligonucleotide probes ^SdhS^I^SSZ^^ " T ^ SeqU6nCes enab,e the desi 9 n * 
the corresponding m RNAs 'using Z^'„E2S^^^ " «"«*«* * *°' a <* 

isolat^dTrSefsS" basea 0 onsewed sequences ^ jn 

prote^aTis^^^ °' 3 CaUSin9 * e 0^°" °< a 

purified from the serum cX£^Z££ ^o^ZpZZI^ °' ^ ^ hotatad "* 

protein, of which the sequence partly has S^SSSJl ? a T ! amer,ca ^ (Hayakawa et al., sup ra ;. This 
the gene coding for the protein fusion of Z oen to a llLhi PUb " Ca,, ° n - Can be made ^pressab.e by isolation of 
the plant according to standard' l^l^S^r^T ' ' ^ ,ranSf0rmafon of said 9 ene into 

l?trZ.ot r r ay be , SaleC,ed ,rorn an V Qene of driving transcription in plant cells 

If trehalose accumufcfon ,s only desired in certain plant parts, such as potato (mini-)«ubers. the trehalase inhibrtory 
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DN A construct (e a the antisense construct) comprises a promoter fragment that is preferentially expressed in (mini-) 
tuber T!^^La^ trehalase levels in the remainder of the plant's cells to be substantially unaffected^ 
^neSL ejects of trehalose to neighbouring plant cells due to trehalose diffusion, ,s counteracted by unaffected 
pndoaenous trehalase activity in the remainder of the plant. 

TZ Example illustrating the invention, wherein treha.ose phosphate synthase is produced under the control of 
the patatin promoter fragment, also the trehalase-inhibitory construct may comprise a promoter fragment of the patat.n 

960 Mutatis mutandis if trehalose is to be accumulated in tomato fruit, both a plant expressible trehalose phosphate 
synthase qene which is at least expressed in the tomato fruit is to be used, as well as a plant expressible treha ase- 
nh ibitory DN A construct, which should be expressed preferentially in the fruit, and preferably not or not substanfiaHy. 
outsideThe fruit An example of a promoter fragment that may be used to drive expression of DNA^onstructs prefer- 
entfaSy torn o frurt is disc.osed in EP 0 409 629 A1 . Numerous modificatbns of this aspect oHhe .nvenuon. tha do 
not depart from the scope of this invention, are readily envisaged by persons having ordinary sk.ll .n the art to wh.ch 
this mention P^ins^^ ^ ^ ^ ^ ^ such as caused by end tre . 

halase is the introduction into the genome of the plant host of an additional copy of said endogenous trehalase , gena 
It is often observed that the presence of a transgene copy of an endogenous gene silences the expression of both the 
endoqenous gene and the transgene (EP 0 465 572 A1 ). , Ko 
According to one embodiment of the invention accumulation of trehalose is brought about in plants wherein the 
capacity of producing trehalose has been introduced by introduction of a plant expressible gene construct encodmg 
trehalose phosphate synthase (TPS), see for instance WO 95/061 26. 

Any trehalose phosphate synthase gene under the contro. of regulatory elements necessary for expression^ of DN A 
in plant cells, either specifically or constitutively, may be used, as long as . I. capable of P"^.^ J**?* 
phosphate synthase activity Most preferred are the trehalose phosphate synthase genes which also harbour a coding 
sequence for trehalose phosphate phosphatase activity, the so called bipartite enzymes. Such a gene, formerly only 
known to exLt in yeast (see e.g. WO 93/17093), can also been found in most plants. This app .cation descnbes the 
Sda L of such a gene from the sunflower Helianthus annuus, whi.e also evidence is 9 K,en for-the ex.stence of a 
homoSus gene in Nicotians tabaoum. It is be.ieved that the use of a bipartite enzyme enhances th e produc t,on of 
trehalose because it enables completion of the metabolic pathway from UDP-glucose and glucose-6-phosphate .nto 
trehalose at one and the same site. Hence, the two-step synthesis is simplified into a one-step react.on. thereby ,n- 
creasinq reaction speed and, subsequently, trehalose yield. «,«,«, K i» * ««. 

As genes involved in trehalose synthesis, especially genes coding for bipartite enzymes, become available from 
other sources these can be used in a similar way to obtain a plant expressible trehalose synthes.zmg gene according 

^^Sourc^sfor isolating trehalose synthesizing activities include microorganisms (e.g. bacteria, yeast, fungi), but 
thP«;e aenes can also be found in plants and animals. m 

The inveSon also encompasses nucleic acid sequences which have been obtained by mooting the nude.c aod 
sequence encoding enzymes active in the synthesis of trehalose by mutating one or more codons so > tha j « esutts in 
amino acid changes in the encoded protein, as long as mutation of the amino ac,d sequence does not ent.rely abol.sh 

" "^W^*™**™* of the invents p.ants are genetically altered to produce and accumulate tre- 
halose in specific parts of the plant, which were selected on the basis of considerations such as substr ^ avai.ab.my 
for the enzvme insensitivity of the plant part to any putative adverse effects of trehalose on plant cell functioning and 
tne ^LTPe red site for trehalose synthesising enzyme expression are starch storage parts of p.ants. In part.cu.ar 
potato tubers are considered to be surtable plant parts. A preferred promoter to achieve selective enzyme expression 
in microtubers and tubers of potato is obtainable from the region upstream of the open reading frame of the patatm 

^pSKJSsl ssst *— ~ s ^ thase ° n,y may be ,urther mo r d * r?r 9 

additional genes that encode phosphatases that are capable of the conversion of trehalose phosphate into trehalose. 
A^tTn^taLube 

appears to be present, so that the introduction of a trehalose phosphate phosphatase (TPP) gene is not an absolute 
requirement.^ ^ ^ Spermatophyta are the Angiospermae, notably the Dicotyledoneae, comprising 

inter alia the Solanaceae as a representative family, and the Monocotyledoneae, comprising mteral* he Grammeae 
as a representative family. Suitable host plants, as defined in the context of the present invention .nclude .plant > (as 
well as parts and cells of said plants) and their progeny which have been generally modified using recomb.nant DMA 
techniques to cause or enhance production of trehalose in the desired plant or plant organ; these plants may be used 
directly (e g the plant species which produce edible parts) in processing or the trehalose may be extracted and/or 
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«M,sa. a* acm/nata,. b^J^^^^^!"2^ a h S app,e e * banana 

e.g. avftmjt cucumber fCuo,™* e g «JJ£? ? ^ 65 (SUCh 38 ,he sweet cherr V' P ^ 

nuts (such as the walnut. Jugtans eg rial XStl f ' ' em ° n ^ *"* me,on < Cucu ™ «■** 

a* Pe/s/ca;, pear fPyra eg e£J£IS^ T 9 * ^ AflL peach 

flfiWfa. e.g. masctete,, La^Sp^n To ZZ* ? TT* (Pn "**- ** d °™****). strawberry 
(such as Branca oleJea), JSSSSES? it S SUCh 88 ^ flMbdta ^ sa ^ cabba 9*s 

rsp/hacaoferaceaa;, tobacco f/SS teto^m! r^^i ^"'^ lettuce A**"* spinach 
carro, ^ -0^^^ «■ 
(Dioscorea esculenta), sweet potato f/pomoea batata) J Z! ' 0fass,ca / a PaA radish (Raphanus sativus), yam 
sa,,Vum,, soybean fG/yc/n J? A ^S^^^^Z!^ ""^ Pea 

saf,Va;, tubers, suchas kohlrabi (BtassiLoleiao^^^Z! ^ °° m (Zea rice f°0*a 

be conserved by drying in the presence of «n^?^ f So/a ™"» tuberosum), and the like. The edible parts may 
expresses trehalose ,herSin dUG l ° the 

presence of a plant 

The use of Agrobaclrium tu^facSTorZo^enZ ZoleZ * « " eXpreSSed in Said p,ant ce »- 

procedures are availabte for the ^uc^DnZoZ^Te^^ tra "«°" ■ Purred, but other 
the calcium/polyethylene glycol method elect*™,* £n rl • Examples are transformation of protoplasts using 

rykus, 1990, Bio/Technol. 8, ^T^^^Ta TT^ DNA <»™ P a ™* bombardment (Po, 
used. Alsotransformation^otocol invo^ COated bombardment may be 

(e.g. from the Cauliflower Mosaic Virus 2? 7 < ^S^^fT^■^ ta ^^~^ , ^ 
referredtoasagroinfection^^ 

the protoplasts, cells or plant parts that have been taL**™~T.I! 177 179) selection and/or screening, 
known in the art (Horsch et aL 1985, Science S iSq i^T) re9en6r8ted ,rt ° Wh °' e P ' ants " usin 9 
intrc^ 

formation of monocot species are t,Co^ ** trans- 

ment of explants or suspension cells and direct mtuZ7Tl t 9 mbactenum ^™^ ™croprojectile bombard- 
274-276). ^rotectem^mediated t^«^£ if^,- electroporahon (Shimamoto, etai, 1989, Nature 338 
Plants haveTeenobtain^C^^^ We "; n riCe (WO 94/00977 >- T ^9enic rniiFe 

acetyltransferase (an enzyme vvh^Mna^^^^ 

suspension culture by microprojectile bonSJ^ SS^rJ^^^ ,n, ° embr y o 9^ic cells of a ma*e 
ofgeneticmaterialintoaleuro^eprotoSs^ Plant Cell, 2, 603-618). The introduction 

1989, Plant Mol. Biol. 13, 21 -30) Wheat pTan ^^^^ ta ^ ^ ^ Wed (Lae, " 
lecting only the aged compact and ncJlr emb^n^ 9 , , embr y°9*™ suspension culture by se^ 
pension cultures (Vasi., 1 m Te^no? 8^29^) t,SSU6S establishme "t of the embryogenic sus- 

- - as trLtS^ (including marker genes), such 

thatis expressed inaplant cell Also intense ^ he '"^ be derived any gene 

r t™ e9 enesa?cordin 9 1«K^;C==S: 

9-^^^^^ 9 ene „ked to the plant expressible 

formats is we., within the scope of the aveX skiMed worker Z ° 8 M * 9ene in p,ant trans - 

the neomycin phosphotransferase gen^s 3rin g nt^l '^* US6d 9enes are 

ferasegenefromrat^erconferringListance^g,^ 

confernng upon overexpression resistance to glutamine svnthetase7nhrhrtlr; 5 f 3)> glutam, ne synthetase 
the acetyl transferase gene from Streptomy Js ^rTJoc ^rom^« ^ , 88 phos P hinothric '"" (WO87/05327), 

phinothricin (EP-A 275 957). the gene enco^gTS^^^^ f"?""? ? 8 ' StanCe * the Se,ective a 9en, phos- 

ance to N-phosphonomethy glycine the bTrcene con^n r«Ti phOSpha,e s y nthase ( Ep SPS) conferring toler- 
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Preferred plant material for transformation, especially for dicotyledonous crops are '^-discs which can be readily 
transformed and have good regenerative capability (Horsch R.B. et al., (1 985) Science £27 229-1 231). 

It is immaterial to the invention how the presence of two or more genes in the same plant is effected. This can mter 
alia done be achieved by one of the following methods: 

(a) transformation of the plant line with a multigene construct containing more than one gene to be introduced, 

(b) co-transforming different constructs to the same plant line simultaneously, 

(c) subsequent rounds of transformation of the same plant with the genes to be introduced, 

(d) crossing two plants each of which contains a different gene to be introduced into the same plant, or 
io (e) combinations thereof. 

The field of application of the invention lies both in agriculture and horticulture, for instance due to improved prop- 
erties of the modified plants as such (e.g. stress tolerance, such as cold tolerance, and preferably drought resistance, 
a^d nc easeTn post-harvest quality and shelf-life of plants and plant products), as well as in any form of .ndustjy where 
is trehalose is or will be applied in a process of forced water extraction, such as drying or freeze dry.ng. Trehalose can 
be useTor so d as such'for instance in purified form or in admixtures, or in the form of a plant product, such as a tuber 
a fruit, a flower containing the trehalose, either in native state or in (partially) dehydrated form, and the like. Plant parts 
harbouring (increased levels of) trehalose phosphate or trehalose may be used or sold as such or processed without 

Wso rehalose'calu.e extracted and/or purified from the plants or plant parts producing it and subsequently used 
in an industrial process. In the food industries trehalose can be employed by adding trehalose to foods before drying. 
Drying of foods is an important method of preservation. Trehalose seems especially useful to conserve food products 
Jough conventional air-drying, and to allow for fast reconstitution upon addition of water o a h.gh qual.ty produc 
(Rosere* a/, July1991, Trends in Food Science and Technology, pp. 166-169). The benefits include retention of natural 
flavors/fragrances, taste of fresh product, and nutritional value (proteins and vitamins). It has been shown that trehalose 
has the ability to stabilize proteins e.g. vaccines, enzymes and membranes, and to form a chemically inert stable 
glass. The low water activity of such thoroughly dried food products prevents chemtcal reactions, that could cause 

^'Sw crops like corn, cassava, potato, sugar beet and sugarcane have since long been used as a natural source 
for bulk carbohydrate production (starches and sucrose). The production of trehalose in such crops, faclrtated by 
genetic engineering of the trehalose-biosynthetic pathway into these plant species, would allow the exploitation of such 
enaineered crops for trehalose production. 

Trehalose is also used in drying or storage of biological macromolecules, such as peptides, enzymes, polynucle- 

° tld AI^ in this specification are indicative of the level of skill in the art to which the invention pertains^ 

All publications, whether patents or otherwise, referred to previously or later in this specification are herein incorporated 
by Terence as if each of them was individual^ incorporated by reference. In particular WO 95/01 446, cited herein, 
describing the production of trehalose in higher plants by genetic manipulation is herein incorporated by ^rence^ 
The Examples given below illustrate the invention and are in no way intended to indicate the limits of the scope 
40 of the invention. 

Experimental 

DNA manipulations 

All DNA procedures (DNA isolation from E.coli, restriction, ligation, transformation, etc.) are performed according 
to standard protocols (Sambrook eta/. (1989) Molecular Cloning: a laboratory manual, 2nd ed. Cold Spring Harbor 
Laboratory Press, CSH, New York). 

50 Strains 

In all examples E coli K-12 strain DH5a is used for cloning. The Agrobacterium tumefaciens strains used for plant 
transformation experiments are EH A 105 and MOG 101 (Hood et al 1993, Trans. Research 2, 208-218) 

ss Isolation of a patatin promoter/construction of oMOG546 

A patatin promoter fragment is isolated from chromosomal DNA of Sotanum tuberosum^. Bintje using polymer- 
ase chain reaction. A set of oligonucleotides, complementary to the sequence of the upstream region of the *pat21 
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5' AAG CTT ATG TTG CCA TAT AGA GTA G 3' PatB33.2 (SEQIDNO-3, 
5' GTA GTT GCC ATG GTG CAA ATG TTC 3' Pat ATG. 2 (SBQIDNO:4) 

Coned using EcoR, .inkers Jo a P^SS^^^^^^ ^ and is 

Construction of pMOQ 7QQ 

incorporated harem by .atatance. Ascribed ,n dalatl In Intatrallonal patent application WO 9&01«S. 

Construction of p MOG845 

(Hindlll) PstI Kpnl Hindlll 

5' AGCT CTGCAG TGA GGTACC A 3' TCV 11 (SEQIDNO.S) 

3' GACGTC ACT CCATGG TTCGA 5' TCV 12 (SEQIDNO:*) 

Pill) (An, G., Mitra, A.. Choi, H K Costa M A An i< T^hn^nh □ Proteinase inhibitor II terminator (Pot- 

115-122). The PotPil, terminator isTs^ed by ' P C R e^^uL r^ ,° * ° 989) ^ Plant C °" 1: 
Desiree as a template and the following set «^SSS£ 9 Chr ° mOSOmal DNA ***d from potato cv 



5' GTACCCTGCAGTGTGACCCTAGAC 3' tc V 15 

45 5 ' TCGATTCATAGAAGCTTAGAT 3 » TCV 16 



(SEQIDNO:7) 
(SEQIDNO: 8) 



SO 



ss 



^K^CSSi" a EcoRI-Hind,,, fragment into the binary vector pMOG402 
Bureau voor Schimmelcu.tures SSS^S'SSJ'SS'S ^ d9p °-" ad « 4,16 Cent ^ 

the Accession Number given by the .ntem^naToe^C.n^n ifcTs nT"*"* " ^ * 1 " 5; 
Triparental matinqs 

<**a> M*. .,„,„ MOGtOI 0, EHA.05 « SSS^ $A "' 7M7 """> ^«"»- 
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Transformation of tobacco (Nicotiana tabacum SR1 ) 
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Tobacco is transformed by cocultivation of planttissue with Agrobacterium_tumefaciens strain MOG1 01 containing 
the binary vector of interest as described. Transformation is carried out using cocultivation of tobacco (Nicotiana tab- 
acumSRDIeaf disks as described by Horsch et a/. 1 985, Science 227, 1229-1231. Transgenic plants are regenerated 
from shoots that grow on selection medium containing kanamycin, rooted and transferred to soil. 

Transformation of potato tuber discs 

Potato (Solanum tuberosum cv. Kardal) is transformed with the Agrobacterium strain EH A 105 containing the 
binary vector of interest. The basic culture medium is MS30R3 medium consisting of MS salts (Murashige, T and 
Skooq F (1 962) Physiol Plan. 14, 473), R3 vitamins (Ooms et a/. (1 987) Theor. Appl. Genet. 73, 744), 30 g/l sucrose, 
0 5 q/l i MES with final pH 5 8 (adjusted with KOH) solidified when necessary with 8 g/l Daichin agar. Tubers of Solanum 
tuberosum cv. Kardal are peeled and surface sterilized by burning them in 96% ethanol for 5 seconds. 

Extinguish the flames in sterile water and cut slices of approximately 2 mm thickness. Disks are cut with a bore 
from the vascular tissue and incubated for 20 minutes in MS30R3 medium containing 1-5 xlC* bactena/ml of Agro- 
bacterium EHA 105 containing the binary vector. Wash the tuber discs with MS30R3 medium and transfer them to 
solidified postculture medium (PM). PM consists of M30R3 medium supplemented with 3.5 mg/l zeatm riboside and 
0 03 mg/l indole acetic acid (IAA). Aftertwo days, discs were transferred to fresh PM medium with 200 mg^ ce otax.m 
and 100 mg/l vancomycin. Three days later, the tuber discs are transferred to shoot induction medium (SIM) which 
consists of PM medium with 250 mg/l carbenicillin and 100 mg/l kanamycin. After 4-8 weeks, shoots emergmg from 
the discs are excised and placed on rooting medium (MS30R3-medium with 100 mg/l cefotax.m, 50 mg/l vancomycin 
and 50 mg/l kanamycin). The shoots are propagated axenically by meristem cuttings. 

25 Potato stem-seament transformation protocol. 

Potato transformation experiments using stem-internodes were performed in a similar way as described by Newell 
C.A. et at., Plant Cell Reports 10: 30-34, 1990. 

30 Induction of micro-tubers 

Stem segments of in vitro potato plants harbouring an auxiliary meristem are transferred to micro-tube r inducing 
medium Micro-tuber inducing medium contains 1 X MS-salts supplemented with R3 vitamins, 0.5 g/l MES (final pH= 
5.8, adjusted with KOH) and solidified with 8 g/l Daishin agar, 60 g/l sucrose and 2.5 mg/l kinetin. After 3 to 5 weeks 
35 of growth in the dark at 24 0 C, micro-tubers are formed. 

Trehalose assay 

Trehalose was determined quantitatively by anion exchange chromatography with pulsed amperometric detection 
Extracts were prepared by adding 1 ml boiling water to 1 g frozen material which was subsequently heated for 15 at 
1 00°C Samples (25 pi) were analyzed on a Dionex DX-300 liquid chromatograph equipped with a 4 x 250 mm Dionex 
35391 "carbopac PA-1 column and a 4 x 50 mm Dionex 43096 carbopac PA-1 precolumn. Elution was with 100 mM 
NaOH at 1 ml/min. Sugars were detected with a pulsed amperometric detector (Dionex, PAD-2). Commercially available 
trehalose (Sigma) was used as a standard. 
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Isolation of Validamvcin A 

Validamycin A is isolated from Solacol, a commercial agricultural formulation (Takeda Chem. Indust., Tokyo) as 
described by Kendall etal. (1 990) Phytochemistry. Vol. 29, No. 8, pp. 2525-2528. The procedure involves ion exchange 
so chromatography (QAE-Sephadex A-25 (Pharmacia), bed vol. 10 ml, equilibration buffer 0.2 mM Na-Pi pH 7) from a 
3% agricultural formulation of Sofacol. Loading 1 ml of Sotacol on the column and eluting with water in 7 fractions, 
practically all Validamycin is recovered in fraction 4. 

Based on a 100% recovery, using this procedure, the concentration of Validamycin A was adjusted to 110 M in MS- 
buffer, for use in trehalose accumulation tests. 
ss Alternatives Validamycin A and B may be purified directly from Streptomyces hygroscopicus var. hmoneus as de- 
scribed by Iwasa T. etal, 1971 , in The Journal of Antibiotics 24(2), 11 9-123, the content of which is incorporated herein 
by reference. 
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Construction nf pMQG!027 

-1^^^ «J - -rsed orientation under 

construction of plviOG799 and can be ShTZSi^T*" ° f thiS VeC '° r iS Very similar t0 the 
by triparental mating to AQn£*£mS^^ 

Plants having reduced levels of trehafie actS ' C6 " S ^ ,0 9enefa,e tranS9enic 

Construction of DMOG109R 

co^™ in «he reversed orientation under 

PMOG845 and can be performed by JTSir ^Wlled L T !" Simi ' ar l ° the ^ruction of 
mating to AgrobacterL, this strain can be used in X^Xr^Z^ ^ °' ™* V6Ct ° r by triparental 
having reduced fevels of trehalase activity* TubeJ-tissua ,ranS, ° rma,IOn ex P<™ts to generate transgenc plants 

Construction of pMOG 1078 

o ri en T :rr r h c e = 

sequently used for cloning in the btna" lector ° pen ™ d >»9 f ra ™ ™s fragment was sub- 

PMOG1078. pMOGSOO is derived iromoMOGJ^T ? > ! COnst,,u,,ve de35S CaMV promoter yielding 
derived of P MOG23 (described" ^V^T2^^^?^! P, ^ hk ? r hM r6St ° red pMOG «* is 
sky R.L., Fine M.. Pe.low J.W., Proc Nat. Acl^cT^A^ ^SS^'SST" Ph ° Sph ° transferase (Yenof- 

EXAMPLE 1 
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Trehalose production in toh^ cco plants transformed with pMnnyoo 
selfing ( S1 ). seeds of these transgenic plantsTre ^ 

Kanamycin resistant seedlings and wil Mw ^tobacco nbn ^ , , 9ermina,ed '" medium with Kanamycin. 

Validamycin A. As a control, ?^sgen k ? JSElSSJill T\ f™* *° MS " medium supplemented with 1 & M 
A. Analysis of leaves and roots of pints orown on Sin ^ P A P T transferred to m «dium without Xfelidamycin 





with Validamycin A 


without Validamycin A 


leaf 


roots 


leaf 


roots 


PMOG799.1 
PMOG799.13 
PMOG799.31 
| Wild-type SR1 


0.0081 
0.0110 
0.0008 


0.0044 
0.0080 
0.0088 




0.003 



so EXAMPLE 2 



. Trehalose production in potato m icro-tubers transformed with P Mnr,«^ 
bourlnXt" 

-™,idam^^ 
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Validamycin A (Table 2). No trehalose was detected in wild-type m-tubers. 

Table 2. 
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Trehalose (% fresh weight) 


+Validamycin A 


-Validamycin A 


845-2 


0.016 




845-4 






845-8 


0.051 




845-1 3 


0.005 




845-22 


0.121 




845-25 


0.002 




wT Kardal 







EXAMPLE 3 

Trehalose production in hvdrocultures of tobacco plants tra nsformed with pMOG799 

Seeds (S1) of selfed tobacco plants transformed with the binary vector pMOG799 are surface sterilised and ger- 
minated in vitro on MS20MS medium containing 50 jig/ml Kanamycin. Kanamycin resistant seedlings are transferred 
to soil and grown in a growth chamber (temp. 23°C, 1 6 hours of light/day). After four weeks, seedlings were transferred 
to hydrocultures with ASEF clay beads with approximately 450 ml of medium. The medium contains 40 g/l Solacol 
dissolved in nano-water buffered with 0.5 g/l MES to adjust to pH 6.0 which is sieved through a filter to remove solid 
particles Essential salts are supplemented by adding POKONTM (1.5 ml/l). The following antibiotics are added to 
prevent growth of micro-organisms: 500u-g/ml Carbenicillin, 40uxj/ml Nystatin and 100jig/ml Vancomycin. As a control, 
transgenic seedlings and wild-type plants are transferred to medium without Solacol. Analysis of leaves of plants grown 
on Solacol shows elevated levels of trehalose compared to the control plants (Table 3). No trehalose was detected in 
wild-type tobacco plants. 

Table 3 



Solacol 


Trehalose (%w/w) 


pMOG 799.1-1 


+ 


0.008 


pMOG 799.1-2 


+ 


0.004 


pMOG 799.1-3 






pMOG 799.1-4 






pMOG 799.1-5 


+ 


0.008 


pMOG 799.1-6 






pMOG 799.1-7 


+ 


0.005 


pMOG 799.1-8 






pMOG 799.1-9 






pMOG 799.1-10 




0.007 


Wild-type SR1-1 






Wild-type SR1-2 


+ 




Wild-type SR1-3 






Wild-type SR1-4 


+ 





Example 4 

Cloning of a full length cDN A encoding trehalase from potato tuber 

Using the amino acid sequence of the conserved regions of known trehalase genes (E.coli, Yeast, Rabbit, B. mori) 
(fig. 3), four degenerated primers were designed: 
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C C C CGT GT A TTAT 
GG T GGI A G A TT TAA IGA AG T C TA CGGC TGGGAC TaSe24 <SEQIDNO: 11) 

5 

GTICCIGGIGGICGITT^IGA^T Tase 25 (SEQIDNO:12, 

10 T GA TG A A 

GGIGG cT G1 ct ICGI ca 1AG g TA g TA Tase26 (S EQIDNO:13) 

C G AT A 

1S IqVTI^CCATC^AAICCITC T ase27 (SEQIDNO:14) 

EXAMPLE 5 

Homology between the trehalase oene from potato with other Solans** 

cv P G etit SanTSli^JTSf ^^^T* 0 " eSCU,entum c * Moneymaker), tobacco (Nicotianatabacum 
EXAMPLE 6 

Overexpression of a potato i rehalase cDNA in Nicotiana 

Tobacco leaf discs are transformed with the binary vector pMOG1078 usinq Aarobacterium t „ mo , a ^ c T 
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the identity of the potato trehalase cDNA. 
EXAMPLE 7 

5 Transformation of PMOG845 transgenic po tato plants with pMOG1027 

In order to super-transform pMOG845 transgenic potato lines with an anti-sense trehalase construct (pMOG1 027), 
stemsegmentswerecutfrom in vitrocultured P otatoshootstransgenicfor P MOG845. Three parent lines were selected, 
DMOG845/11 /22 and /28 that revealed to accumulate trehalose in microtubers when grown on vahdamycm A. The 
stem segments were transformed with the binary vector pMOG1027 using Agrobacterium tumefacens. Supertrans- 
formants were selected on Hygromycin and grown in vitro. 



10 



EXAMPLE 8 

,5 Trehalose production in tubers of potato plants transgenic for PMOG845 and pMOG1027 

Microtubers were induced on explants of the pMOG845 transgenic potato plants supertransformed with 
PMOG1027 using medium without the trehalase inhibitor validamycin A. The accumulation of trehalose, up to 0.75 mg 
fresh weight was noted in the supertransformed lines proving the reduced trehalase activrty ,n these l.nes us.ng 
the anti-sense trehalase expression strategy (Fig. 6). 
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EXAMPLE 9 

Isolation of a bipartite TPSAPP gene from Helianthus annuus 

To isolate a bipartite clone from H. annuus, a PCR amplification experiment was set up using two degenerate 
primers TPS<feg2 and TPS-deg5. This primerset was used in combination with cDNA constructed on H. annuus leaf 
RNA as a template. A DNA fragment of approximately 650 bp. was amplified having a high similarity on am.no acid 
level when compared to tps coding regions from E. coli and yeast. Based on its nucleot.de sequence, homologous 
SeTs were designed and used in a Marathon RACE protocol (C.ontech) to isolate the 5' and 3' parts , of corresponding 
tps cDNA's. Using Tprimercombinations SUNGSP1 (or 2VAP1 in RACE PCR, no bands ; were observed ^whereas nested 
PCR with NSUNGSP1(or2)/AP2 resulted in several DNA fragments. Some of these fragments hybridized w.th a 32P 
labelled Sunflower tps ragment after Southern blotting. Two fragments of circa 1 .2 kb and 1 .7 kb. corresponds re- 
pectely to the 5" and 3' 'part, were isolated from gel, subcloned and sequenced. The .nucleotide 
a clear homology with known tps and tpp.sequences indicating the b.partrte nature of the .solated cDNA (SEQ ID NO 
1) Using a unique Xmal site present in both fragments, a complete TPS/TPP b.part.te cod.ng region was obta.ned and 
subcloned in pGEM-T (Promega) yielding pMOG1192 (Fig. 2). 
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TPSdeg2: 


tig 


git 


kit tyy tic aya yic 


cit tyc c 


(SEQIDNO: 


23) 


TPSdegS: 


gyi 


aci 


arr 


ttc ati 


ccr 


tci 


c 


(SEQIDNO: 


27) 


SUNGSP1: 


cga 


aac 


ggg 


ccc ate 


aat 


ta 




(SEQIDNO: 


15) 


SUNGSP2 : 


teg 


atg 


aga 


tea atg 


ccg 


ag 




(SEQIDNO: 


16) 


API (Clontech) : 


cca 


tec 


taa 


tac gac 


tea 


eta 


tag ggc 


(SEQIDNO: 


17) 


NSUNGSPl: 


cac 


aac 


agg 


ctg gta 


tec 


eg 




(SEQIDNO: 


18) 


NSUNGSP2 : 


caa 


taa 


cga 


act ggg 


aag 


cc 




(SEQIDNO: 


19) 


AP2 (Clontech) : 


act 


cac 


tat 


agg get 


cga 


gcg 


gc 


(SEQIDNO: 


20) 
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EXAMPLE 1Q 



isolation of a bipartite TPSfTPP r no f Njgotjana rTftinm . 



TPSdegl : 
TRE-TPP-16: 
TPSdeg2 : 
TRE-TPP-15 : 
TRE-TPP-10 : 
TRE-TPP-6: 



GAY ITI ATI TGG RTI CAY GAY TAY CA 

CCI ACI GTR CAI GCR AAI AC 

TIG GIT KIT TYY TIC AYA YIC CIT TYC C 

TGR TCI ARI ARY TCY TTI GC 

CCR TGY TCI GCI SWI ARI CC 

TCR TCI GTR AAR TCR TCI CC 



(SEQIDNO: 21) 
(SEQIDNO: 22) 
(SEQIDNO: 23) 
(SEQIDNO: 24) 
(SEQIDNO: 25) 
(SEQIDNO: 26) 
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SEQUENCE LISTING 



5 (1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: MOGEN INTERNATIONAL NV 

(B) STREET: Einsteinweg 97 
10 (C) CITY: Leiden 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP) : 2233 CB 

(G) TELEPHONE : (31) 71-5258282 

(H) TELEFAX: (31) 71-5221471 



15 



(ii) TITLE OF INVENTION: Enhanced accumulation of trehalose in plants 
(iii) NUMBER OF SEQUENCES: 27 



25 



30 



20 <iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 

40 ■ 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 25.. 2485 

(D) OTHER INFORMATION: /function= "trehalose phosph. 
45 synthase and trehalose phosph. phosphatase" 

/product^ "bipartite enzyme" 

(ix) FEATURE: 

(A) NAME /KEY: unsure 
so (B) LOCATION: 1609.. 1611 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

51 



CTGATCCTGC GGTTTCATCA CAAT ATG ATA CTC TTA CAT CTG ATG CCC CTT 

Met lie Leu Leu His Leu Met Pro Leu 
1 5 



CAG ATG CTC CCA AAT AGG TTG ATT GTC GTA TCG AAT CAG TTA CCC ATA 99 
Gin Met Leu Pro Asn Arg Leu He Val Val Ser Asn Gin Leu Pro He 
10 10 15 20 25 



ATC GCT AGG CTA AGA CTA ACG ACA ATG GAG GGT CCT TTT GGG ATT TCA 
He Ala Arg Leu Arg Leu Thr Thr Met Glu Gly Pro Phe Gly lie Ser 



30 35 40 



CTT GGG ACG AGA GTT CGA TTT ACA TGC ACA TCA AAG ATG CAT TAC CCG 
Leu Gly Thr Arg Val Arg Phe Thr Cys Thr Ser Lys Met His Tyr Pro 
45 5 0 



55 



GCT TAT GTT CAC GTG AAC AAA GAG TTT TCC CAG AAG GTG ATG GAG GCA 
Ala Tyr yal His Val Asn Lys Glu Phe Ser Gin Lys Val Met Glu Ala 
140 145 150 

JS C f T CGT AGC AAT TAT GTA TGG ATA CAT GAC TAC CAT TTA ATG 
Val Thr Asn Arg Ser Asn Tyr Val Trp He His Asp Tyr His Leu Met 
155 160 165 

ACG CTA CCG ACT TTC TTG AGG CGG GAT TTT TGT CGT TTT AAA ATC GGT 
Thr Leu Pro Thr Phe Leu Arg Arg Asp Phe Cys Arg Phe Lys He Gly 

175 180 * 185 

III Phi ?* J 60 TTT CCT TCC TCG GAG GTT TAC AAG ACC CTA 

Phe Phe Leu Hxs Ser Pro Phe Pro Ser Ser Glu Val Tyr Lys Thr Leu 

55 190 195 200 



147 



195 



CAG CCG TTG AGG TTT TCT ATT CTT GGC GAT CCA CTA AGG GCT GAC GTT 243 
Gin Pro Leu Arg Phe Ser He Leu Gly Asp Pro Leu Arg Ala Asp Val 
60 65 70 

GGC CCT ACC GAA CAA GAT GAC GTG TCA AAG ACA TTG CTC GAT AGG TTT 291 
Gly Pro Thr Glu Gin Asp Asp Val Ser Lys Thr Leu Leu Asp Arg Phe 
75 80 85 

AAT TGC GTT GCG GTT TTT GTC CCT ACT TCA AAA TGG GAC CAA TAT TAT 339 
30 **" CYS W1 Ala Val Phe Val Pro Thr Ser Lys Trp Asp Gin Tyr Tyr 

95 100 105 

CAC TGC TTT TGT AAG CAG TAT TTG TGG CCG ATA TTT CAT TAC AAG GTT 387 
His Cys Phe Cys Lys Gin Tyr Leu Trp Pro He Phe His Tyr Lys Val 
3S 110 H5 120 

CCC GCT TCT GAC GTC AAG AGT GTC CCG AAT AGT CGG GAT TCA TGG AAC 435 
Pro Ala Ser Asp Val Lys Ser Val Pro Asn Ser Arg Asp Ser Trp Asn 
125 130 135 



483 



531 



579 



627 
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CCA ATG AGA AAC GAG CTC TTG AAG GGT CTG TTA AAT GCT GAT CTT ATC 
Pro Met Arg Asn Glu Leu Leu Lys Gly Leu Leu Asn Ala Asp Leu He 
205 210 2 

GGG TTC CAT ACA TAG GAT TAT GCC CGT CAT TTT CTA ACG TGT TGT ACT 
Gly Phe His Thr Tyr Asp Tyr Ala Arg His Phe Leu Thr Cys Cys Ser 
220 225 230 

CGA ATG TTT GGT TTG GAT CAT CAG TTG AAA AGG GGG TAC ATT TTC TTG 
Arg Met Phe Gly Leu Asp His Gin Leu Lys Arg Gly Tyr He Phe Leu 
235 240 2A5 

GAA TAT AAT GGA AGG AGC ATT GAG ATC AAG ATA AAG GCG AGC GGG ATT 
Glu Tyr Asn Gly Arg Ser He Glu He Lys lie Lys Ala Ser Gly 
250 255 260 

CAT GTT GGT CGA ATG GAG TCG TAC TTG AGT CAG CCC GAT ACA AGA TTA 
Stl Sal Gly Arg Met Glu Ser Tyr Leu Ser Gin Pro Asp Thr Arg Leu 

20 2 7 0 2 7 5 

CAA GTT CAA GAA GTC CAA AAA CGT TCG AAG GAA ATC GTG CTA CTG GGA 
S£n SIT Gin Glu val Gin Lys Arg Ser Lys Glu He Val Leu Leu Gly 
285 2 9° 295 

GTT GAT GAT TTG GAT ATA TTC AAA GGT GTG AAC TTC AAG GTT TTA GCG 
71 Asp Asp Leu Asp He Phe Lys Gly val Asn Phe Lys val Leu Ala 
300 305 310 

TTG GAG AAG TTA CTT AAA TCA CAC CCG AGT TGG CAA GGG CGT GTG GAA 
leu Glu Lys Leu Leu Lys Ser His Pro Ser Trp Gin Gly Arg Val Glu 
315 320 3" 

AAG GTG CAA ATC TTG AAT CCT CTG CGC CGT TGC CAA GAC GTC GAT GAG 
Lys Val Gin He Leu Asn Pro Leu Arg Arg Cys Gin Asp Val Asp Glu 
330 335 340 345 

ATC AAT GCC GAG ATA AGA ACA GTC TGT GAA AGA ATC AAT AAC GAA CTG 
xlt Asn Ala Glu He Arg Thr Val Cys Glu Arg He Asn Asn Glu Leu 
40 350 355 360 

GGA AGC CCG GGA TAC CAG CCC GTT GTG TTA ATT GAT GGG CCC GTT TCG 
Gly Ser Pro Gly Tyr Gin Pro val Val Leu He Asp Gly Pro Val Ser 
365 370 375 

TTA AGT GAA AAA GCT GCT TAT TAT GCT ATC GCC GAT ATG GCA ATT GTT 
Leu Ser Glu Lys Ala Ala Tyr Tyr Ala He Ala Asp Met Ala lie Val 
380 385 390 

ACA CCG TTA CGT GAC GGA CTG AAT CTT ATC CCG TAC GAG TAC GTC GTT 
Thr Pro Leu Arg Asp Gly Leu Asn Leu He Pro Tyr Glu Tyr Val Val 
395 400 405 
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723 



771 



819 



867 



915 



963 



1011 



1059 



1107 



1155 



1203 



1251 



55 



SDOCID <EP 078409SA2_I_> 



17 



10 



20 



30 



35 



EP 0 784 095 A2 

TCC CGA CAA ACT GTT AAT GAC CCA AAT CCC AAT ACT CCA AAA AAG AGC 

Ser Arg Gin Ser Val Asn Asp Pro Asn Pro Asn Thr Pro Lys Lys Ser 

° 415 «0 425 

ATG CTA GTG GTC TCC GAG TTC ATC GGT GTT TCA CTA TCT TTA ACC GGG 
Met Leu val Val Ser Glu Phe He Gly Val Ser Leu Ser "i Jhr §S 
430 435 4 4o 

GCC ATA CGG GTC AAC CCA TGG GAT GAG TTG GAG ACA GCA GAA GCA TTA 1395 
Ala He Arg Val Asn Pro Trp Asp Glu Leu Glu Thr Ala Glu Ala Leu 
445 450 455 



TAC GAC GCA CTC ATG GCT CCT GAT GAC CAT AAA GAA ACC GCC CAC ATG 
Tyr Asp Ala Leu Met Ala Pro Asp Asp His Lys Glu Thr Ala His Met 
460 465 470 

AAA CAG TAT CAA TAC ATT ATC TCC CAT GAT GTA GCT AAC TGG GCT AGC 
Lys Gin Tyr Gin Tyr He lie Ser His Asp Val Ala Asn Trp All sTr 
* 75 480 485 

TTC TTT CAA GAT TTA GAG CAA GCG TGC ATC GAT CAT TCT CGT AAA CGA 
Phe Phe Gin Asp Leu Glu Gin Ala Cys He Asp His Ser Arg Lys Arg 
490 495 500 50 | 

TGC ATG AAT TTA GGA TTT GGG TTA GAT ACT AGA GTC GTC TTT TTG ATG 
Cys Met Asn Leu Gly Phe Gly Leu Asp Thr Arg Val Val Phe Leu Met 
510 515 520 

AGA ACT TTA GCA AGT TGG ATA AAG ATG TCT TGG AAG AAT GCT TAT TCC 
Arg Ser Leu Ala Ser Trp He Lys Met Ser Trp Lys Asn Sa lyl Ser 
525 530 



535 



nil S Sf* f T ^ MA CTT TTG GAC ™ GAC 600 ACT GTT ACT 

Met Ala Gin Asn Arg Ala lie Leu Leu Asp Tyr Asp Gly Thr Val Thr 

540 545 550 



AAA CTG TGC AAT GAT CCA AAG AAC ATG GTG TTC ATC GTT AGT GGA CGC 
Lys Leu Cys Asn Asp Pro Lys Asn Met val Phe He Val Ser Gly Arg 

575 580 585 

AGT AGA GAG AAA ATC TTG GCA GTT GGT TCG GCG CGT GTG AGA ACC CGC 
Ser Arg Glu Lys lie Leu Ala Val Gly Ser Ala Arg S3 £g J£ lf g 
50 590 595 600 

CAT TGC ACT GAG CAC GGA TAC TTT ATA AGG TGG GCG GGT GAT CAA GAA 
Hxs Cys Thr Glu His Gly Tyr Phe He Arg Trp Ala Gly A^p £n" Glu 
605 610 



1299 



1347 



1443 



1491 



1539 



1587 



1635 



1683 



CCA TCT ATC AGT AAA TCT CCA ACT GAA GCT GTT ATC TCC ATG ATC AAC 17,1 
40 Pro Ser He Ser Lys Ser Pro Thr Glu Ala Val He Ser £t J£ ^ 
555 560 565 



1779 



1827 



1875 

Gin Glu 

615 
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TGG GAA ACG TGC GCA CGT GAG AAT AAT GTC GGG TGG ATG GAT GGA AAT 1923 
Trp Glu Thr Cys Ala Arg Glu Asn Asn Val Gly Trp Met Asp Gly Asn 
620 625 630 

CTG AGG CCG GTT ATG AAT CTT TAT ACA GAA ACT ACT GAC GGT TCG TAT 1971 
Leu Arg Pro Val Met Asn Leu Tyr Thr Glu Thr Thr Asp Gly Ser Tyr 
635 640 645 

ATT GAA AAG AAA GAA ACT GCA ATG GTT TGG CAC TAT GAA GAT GCT GAT 2019 
lie Glu Lys Lys Glu Thr Ala Met Val Trp His Tyr Glu Asp Ala Asp 
650 655 660 665 

AAA GAT CTT GGG TTG GAG CAG GCT AAG GAA CTG TTG GAC CAT CTT GAA 2067 
is Lys Asp Leu Gly Leu Glu Gin Ala Lys Glu Leu Leu Asp His Leu Glu 

670 675 680 

AAC GTG CTC GCT AAT GAG CCC GTT GGA GTG AAT CGA ACA GGT CAA TAC 2115 
Asn Val Leu Ala Asn Glu Pro Val Gly Val Asn Arg Thr Gly Gin Tyr 
20 685 690 695 

ATT GTA GAA GTT AAA CCA CAG TCC CCC ATT AAT TAC CTT CTT GTT ATG 2163 
lie Val Glu Val Lys Pro Gin Ser Pro lie Asn Tyr Leu Leu Val Met 
700 " 705 710 

25 

ACA TTC ATA GGC ACT GAT TGT AGA ATC TTT AAC TTA AAT TTC TTT AAA 2211 
Thr Phe lie Gly Thr Asp Cys Arg He Phe Asn Leu Asn Phe Phe Lys 
715 720 725 

TAT GAA TGC AAT TAT AGG GGG TCA CTA AAA GGT ATA GTT GCA GAG AAG 2259 
Tyr Glu Cys Asn Tyr Arg Gly Ser Leu Lys Gly He Val Ala Glu Lys 
730 735 740 745 

ATT TTT GCG TTC ATG GCT AAA AAG GGA AAA CAG GCT GAT TTC GTG TTG 2307 
He Phe Ala Phe Met Ala Lys Lys Gly Lys Gin Ala Asp Phe Val Leu 
750 755 760 

ACG TTG AAT GAT AGA AGT GAT GAA GAC ATG TTT GTG GCC ATT GGG GAT 2355 
Thr Leu Asn Asp Arg Ser Asp Glu Asp Met Phe Val Ala He Gly Asp 
40 765 770 775 

GGA ATA AAA AAG GGT CGG ATA ACT AAC AAC AAT TCA GTG TTT ACA TGC 2403 
Gly He Lys Lys Gly Arg He Thr Asn Asn Asn Ser Val Phe Thr Cys 
780 785 790 



30 



35 



45 



SO 



55 



GTA GTG GGA GAG AAA CCG AGT GCA GCT GAG TAC TTT TTA AAT GAT GTC 2451 
Val Val Gly Glu Lys Pro Ser Ala Ala Glu Tyr Phe Leu Asn Asp Val 
795 800 805 

TCG AGA AGC TCC GGG TGT CTC AGC AAC CAA GGA T GATCCGGAAG 2495 
Ser Arg Ser Ser Gly Cys Leu Ser Asn Gin Gly 
810 815 820 

CTTCTCGTGA TCTTTATGAG TTAAAAGTTT TCGACTTTTT CTTCATCAAG ATTCATGGGA 2555 
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AAGTTGTTCA ATATGAACTT GTGTTCTTGG TTCTGGATTT TAGGGAGTCT ATGGATATAA 2615 
CATTTC 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
io (A) LENGTH: 820 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met He Leu Leu His Leu Met Pro Leu Gin Met Leu Pro Asn Arg Leu 
1 5 10 15 

He Val Val Ser Asn Gin Leu Pro He He Ala Arg Leu Arg Leu Thr 
20 25 30 

Thr Met Glu Gly Pro Phe Gly He Ser Leu Gly Thr Arg Val Arg Phe 
25 3 5 4 0 45 

Thr Cys Thr Ser Lys Met His Tyr Pro Gin Pro Leu Arg Phe Ser He 
50 55 60 



Leu Gly Asp Pro Leu Arg Ala Asp Val Gly Pro Thr Glu Gin Asp Asp 
65 70 75 80 

Val Ser Lys Thr Leu Leu Asp Arg Phe Asn Cys Val Ala Val Phe Val 
85 90 95 

Pro Thr Ser Lys Trp Asp Gin Tyr Tyr His Cys Phe Cys Lys Gin Tyr 

105 HO 

Leu Trp Pro He Phe His Tyr Lys Val Pro Ala Ser Asp Val Lys Ser 
"5 120 125 

Val Pro Asn Ser Arg Asp Ser Trp Asn Ala Tyr Val His Val Asn Lys 
130 135 140 

Glu Phe Ser Gin Lys Val Met Glu Ala Val Thr Asn Arg Ser Asn Tyr 
145 "0 155 iso 

Val Trp He His Asp Tyr His Leu Met Thr Leu Pro Thr Phe Leu Arg 
so 170 175 

Arg Asp Phe Cys Arg Phe Lys He Gly Phe Phe Leu His Ser Pro Phe 
180 185 190 

55 Pro Ser Ser Glu Val Tyr Lys Thr Leu Pro Met Arg Asn Glu Leu Leu 

195 200 205 



2621 
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Lys Gly Leu Leu Asn Ala Asp Leu lie Gly Phe His Thr Tyr Asp Tyr 
210 215 220 

Ala Arg His Phe Leu Thr Cys Cys Ser Arg Met Phe Gly Leu Asp His 
225 230 235 240 

Gin Leu Lys Arg Gly Tyr lie Phe Leu Glu Tyr Asn Gly Arg Ser He 
245 250 255 

Glu lie Lys He Lys Ala Ser Gly He His Val Gly Arg Met Glu Ser 
260 265 270 

,s Tyr Leu Ser Gin Pro Asp Thr Arg Leu Gin Val Gin Glu Val Gin Lys 
275 280 285 

Arg Ser Lys Glu He Val Leu Leu Gly Val Asp Asp Leu Asp He Phe 
290 295 300 

Lys Gly Val Asn Phe Lys Val Leu Ala Leu Glu Lys Leu Leu Lys Ser 
305 310 315 320 

His Pro Ser Trp Gin Gly Arg Val Glu Lys Val Gin He Leu Asn Pro 

25 325 330 335 

Leu Arg Arg Cys Gin Asp val Asp Glu He Asn Ala Glu He Arg Thr 
340 345 350 



30 



3S 



40 



val Cys Glu Arg He Asn Asn Glu Leu Gly Ser Pro Gly Tyr Gin Pro 
355 360 365 

Val Val Leu He Asp Gly Pro Val Ser Leu Ser Glu Lys Ala Ala Tyr 
370 375 380 

Tyr Ala He Ala Asp Met Ala He Val Thr Pro Leu Arg Asp Gly Leu 
385 390 395 400 

Asn Leu He Pro Tyr Glu Tyr Val Val Ser Arg Gin Ser Val Asn Asp 
405 410 415 

Pro Asn Pro Asn Thr Pro Lys Lys Ser Met Leu Val Val Ser Glu Phe 
420 425 430 

45 He Gly Val Ser Leu Ser Leu Thr Gly Ala He Arg val Asn Pro Trp 
435 440 445 

Asp Glu Leu Glu Thr Ala Glu Ala Leu Tyr Asp Ala Leu Met Ala Pro 
450 455 460 

so 

Asp Asp His Lys Glu Thr Ala His Met Lys Gin Tyr Gin Tyr He He 
465 470 475 480 

Ser His Asp Val Ala Asn Trp Ala Ser Phe Phe Gin Asp Leu Glu Gin 
55 485 490 495 
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Ala Cys lie Asp His Ser Arg Lys Arg Cys Met Asn Leu Gly Phe Glv 
500 505 510 

5 

Leu Asp Thr Arg Val Val Phe Leu Met Arg Ser Leu Ala Ser Trp il e 
515 520 525 

Lys Met Ser Trp Lys Asn Ala Tyr Ser Met Ala Gin Asn Arg Ala lie 
10 530 535 540 

Leu Leu Asp Tyr Asp Gly Thr Val Thr Pro Ser lie Ser Lys Ser Pro 
545 550 555 560 

75 Thr Glu Ala Val lie Ser Met Ile Asn Lys Leu Cys Asn Asp Pro Lys 

565 570 575 

Asn Met Val Phe Ile Val Ser Gly Arg Ser Arg Glu Lys lie Leu Ala 
580 585 590 

Val Gly Ser Ala Arg Val Arg Thr Arg His Cys Thr Glu His Gly Tyr 
595 600 605 

Phe lie Arg Trp Ala Gly Asp Gin Glu Trp Glu Thr Cys Ala Arg Glu 
€10 615 620 

Asn Asn Val Gly Trp Met Asp Gly Asn Leu Arg Pro Val Met Asn Leu 
625 630 635 640 

Tyr Thr Glu Thr Thr Asp Gly Ser Tyr Ile Glu Lys Lys Glu Thr Ala 
fi45 650 655 

Met Val Trp His Tyr Glu Asp Ala Asp Lys Asp Leu Gly Leu Glu Gin 
660 665 670 

Ala Lys Glu Leu Leu Asp His Leu Glu Asn Val Leu Ala Asn Glu Pro 
675 680 6B5 

Val Gly Val Asn Arg Thr Gly Gin Tyr lie Val Glu Val Lys Pro Gin 
690 695 700 

Ser Pro Ile Asn Tyr Leu Leu Val Met Thr Phe lie Gly Thr Asp Cys 
705 710 715 720 

Arg He Phe Asn Leu Asn Phe Phe Lys Tyr Glu Cys Asn Tyr Arg Glv 
725 730 ~ 735 

Ser Leu Lys Gly Ile Val Ala Glu Lys Ile Phe Ala Phe Met Ala Lys 
so 740 745 750 

Lys Gly Lys Gin Ala Asp Phe Val Leu Thr Leu Asn Asp Arg Ser Asp 
755 760 765 

ss Glu Asp Met Phe Val Ala Ile Gly Asp Gly Ile Lys Lys Gly Arg He 
770 775 780 



25 



30 



35 



40 
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Tht Asn Asn Asn Ser Val Phe Thr Cys Val Val Gly Glu Lys Pro Ser 
785 790 795 800 

Ala Ala Glu Tyr Phe Leu Asn Asp Val Ser Arc, Ser Ser Gly Cys Leu 
805 810 815 



Ser Asn Gin Gly 
820 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AAGCTTATGT TGCCATATAG AGTAG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTAGTTGCCA TGGTGCAAAT GTTC 
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15 



20 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGCTCTGCAG TGAGGTACCA 
(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
2S (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



35 <«i> SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GACGTCACTC CATGGTTCGA 
(2) INFORMATION FOR SEQ ID NO: 7: 

40 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
4i> <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GTACCCTGCA GTGTGACCCT AGAC 
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(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS : 
5 <A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TCGATTCATA GAAGCTTAGA T 
20 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2207 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mKNA 

30 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Solanum tuberosum 

(B) STRAIN: Kardal 

(ix) FEATURE: 
40 (A) NAME /KEY : CDS 

(B) LOCATION: 161.. 1906 

(ix) FEATURE: 

<A) NAME /KEY: misc_f eature 
45 (B) LOCATION: 842.. 850 

(D) OTHER INFORMATION: /function" "putative 
glycosylationsite" 



so < x i) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CTTTTCTGAG TAATAACATA GGCATTGATT TTTTTTCAAT TAATAACACC TGCAAACATT 
CCCATTGCCG GCATTCTCTG TTCTTACAAA AAAAAACATT TTTTTGTTCA CATAAATTAG 

55 
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TTATGGCATC AGTATTGAAC CCTTTAACTT GTTATACAAT ATG GGT AAA GCT ATA 175 

Met Gly Lys Ala lie 
1 5 

ATT TTT ATG ATT TTT ACT ATG TCT ATG AAT ATG ATT AAA GCT GAA ACT 223 
He Phe Met He Phe Thr Met Ser Met Asn Met He Lys Ala Glu Thr 
10 15 20 

TGC AAA TCC ATT GAT AAG GGT CCT GTA ATC CCA ACA ACC CCT TTA GTG 271 
Cys Lys Ser He Asp Lys Gly Pro Val He Pro Thr Thr Pro Leu Val 
25 30 35 

ATT TTT CTT GAA AAA GTT CAA GAA GCT GCT CTT CAA ACT TAT GGC CAT 31 9 

He Phe Leu Glu Lys Val Gin Glu Ala Ala Leu Gin Thr Tyr Gly His 
40 45 50 

AAA GGG TTT GAT GCT AAA CTG TTT GTT GAT ATG TCA CTG AGA GAG AGT 367 
Lys Gly Phe Asp Ala Lys Leu Phe Val Asp Met Ser Leu Arg Glu Ser 
55 60 65 



CTT TCA GAA ACA GTT GAA GCT TTT AAT AAG CTT CCA AGA GTT GTG AAT 415 
Leu Ser Glu Thr Val Glu Ala Phe Asn Lys Leu Pro Arg val Val Asn 
70 75 80 " 85 

GGT TCA ATA TCA AAA AGT GAT TTG GAT GGT TTT ATA GGT AGT TAC TTG 4 63 

Gly Ser lie Ser Lys Ser Asp Leu Asp Gly Phe He Gly Ser Tyr Leu 
90 95 * 100 



AGT AGT CCT GAT AAG GAT TTG GTT TAT GTT GAG CCT ATG GAT TTT GTG 511 
Ser Ser Pro Asp Lys Asp Leu Val Tyr Val Glu Pro Met Asp Phe Val 
105 HO 115 

3S GCT GAG CCT GAA GGC TTT TTG CCA AAG GTG AAG AAT TCT GAG GTG AGG 559 
Ala Glu Pro Glu Gly Phe Leu Pro Lys Val Lys Asn Ser Glu Val Arg 
"0 125 130 



GCA TGG GCA TTG GAG GTG CAT TCA CTT TGG AAG AAT TTA AGT AGG AAA 
Ala Trp Ala Leu Glu Val His Ser Leu Trp Lys Asn Leu Ser Arg Lys 
135 140 145 



607 



GTG GCT GAT CAT GTA TTG GAA AAA CCA GAG TTG TAT ACT TTG CTT CCA 655 
Val Ala Asp His Val Leu Glu Lys Pro Glu Leu Tyr Thr Leu Leu Pro 
45 1 50 !55 160 165 

TTG AAA AAT CCA GTT ATT ATA CCG GGA TCG CGT TTT AAG GAG GTT TAT 703 
Leu Lys Asn Pro Val He He Pro Gly Ser Arg Phe Lys Glu Val Tyr 
170 175 180 

TAT TGG GAT TCT TAT TGG GTA ATA AGG GGT TTG TTA GCA AGC AAA ATG 751 
Tyr Trp Asp Ser Tyr Trp Val He Arg Gly Leu Leu Ala Ser Lys Met 
185 190 195 



ss 
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TAT GAA ACT GCA AAA GGG ATT GTG ACT AAT CTG GTT TCT CTG ATA GAT 7 99 

Tyr Glu Thr Ala Lys Gly lie Val Thr Asn Leu Val Ser Leu lie Asp 
200 205 210 

CAA TTT GGT TAT GTT CTT AAC GGT GCA AGA GCA TAC TAC AGT AAC AGA 847 
Gin Phe Gly Tyr Val Leu Asn Gly Ala Arg Ala Tyr Tyr Ser Asn Arg 
215 220 225 

AGT CAG CCT CCT GTC CTG GCC ACG ATG ATT GTT GAC ATA TTC AAT CAG 895 
Ser Gin Pro Pro Val Leu Ala Thr Met lie Val Asp He Phe Asn Gin 
230 235 240 245 

ACA GGT GAT TTA AAT TTG GTT AGA AGA TCC CTT CCT GCT TTG CTC AAG 943 
Thr Gly Asp Leu Asn Leu Val Arg Arg Ser Leu Pro Ala Leu Leu Lys 
250 255 260 

GAG AAT CAT TTT TGG AAT TCA GGA ATA CAT AAG GTG ACT ATT CAA GAT 991 
Glu Asn His Phe Trp Asn Ser Gly lie His Lys Val Thr lie Gin Asp 
265 270 275 

GCT CAG GGA TCA AAC CAC AGC TTG AGT CGG TAC TAT GCT ATG TGG AAT 1039 
Ala Gin Gly Ser Asn His Ser Leu Ser Arg Tyr Tyr Ala Met Trp Asn 
280 285 290 

AAG CCC CGT CCA GAA TCG TCA ACT ATA GAC AGT GAA ACA GCT TCC GTA 1087 
Lys Pro Arg Pro Glu Ser Ser Thr He Asp Ser Glu Thr Ala Ser Val 
295 300 305 

CTC CCA AAT ATA TGT GAA AAA AGA GAA TTA TAC CGT GAA CTG GCA TCA 1135 
Leu Pro Asn lie Cys Glu Lys Arg Glu Leu Tyr Arg Glu Leu Ala Ser 
310 315 320 325 

GCT GCT GAA AGT GGA TGG GAT TTC AGT TCA AGA TGG ATG AGC AAC GGA 1183 
Ala Ala Glu Ser Gly Trp Asp Phe Ser Ser Arg Trp Met Ser Asn Gly 
330 335 340 



TCT GAT CTG ACA ACA ACT AGT ACA ACA TCA ATT CTA CCA GTT GAT TTG 1231 
40 . Ser Asp Leu Thr Thr Thr Ser Thr Thr Ser He Leu Pro Val Asp Leu 
345 350 355 

AAT GCA TTC CTT CTG AAG ATG GAA CTT GAC ATT GCC TTT CTA GCA AAT 1279 
Asn Ala Phe Leu Leu Lys Met Glu Leu Asp He Ala Phe Leu Ala Asn 
45 360 365 370 

CTT GTT GGA GAA AGT AGC ACG GCT TCA CAT TTT ACA GAA GCT GCT CAA 1327 
Leu Val Gly Glu Ser Ser Thr Ala Ser His Phe Thr Glu Ala Ala Gin 
375 380 385 



AAT AGA CAG AAG GCT ATA AAC TGT ATC TTT TGG AAC GCA GAG ATG GGG 1375 
Asn Arg Gin Lys Ala He Asn Cys He Phe Trp Asn Ala Glu Met Gly 
390 395 400 405 



55 
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CAA TGG CTT GAT TAC TGG CTT ACC AAC AGC GAC 
Gin Trp Leu Asp Tyr Trp Leu Thr Asn Ser Asp 
410 415 

TAT AAA TGG GAA GAT TTG CAC CAG AAC AAG AAG 
Tyr Lys Trp Glu Asp Leu His Gin Asn Lys Lys 
425 430 

TTT GTT CCG CTG TGG ACT GAA ATT TCT TGT TCA 
Phe Val Pro Leu Trp Thr Glu lie Ser Cys Ser 
440 445 

ACT CAG AAA GTA GTT CAA AGT CTC ATG AGC TCG 
Thr Gin Lys Val Val Gin Ser Leu Met Ser Ser 
455 460 

GCA GGG ATT GCA ATG ACC TTG TCT AAT ACT GGA 
Ala Gly lie Ala Met Thr Leu Ser Asn Thr Gly 
470 475 480 

CCG AAT GGT TGG CCC CCC CTT CAA CAC ATA ATC 
Pro Asn Gly Trp Pro Pro Leu Gin His lie He 
490 495 

AGG TCT GGA CTA GAA GAG GCA AGA ACC TTA GCA 
Arg Ser Gly Leu Glu Glu Ala Arg Thr Leu Ala 
505 510 

CGC TGG TTA AGA ACT AAC TAT GTG ACT TAC AAG 
Arg Trp Leu Arg Thr Asn Tyr Val Thr Tyr Lys 
520 525 

TAT GAA AAA TAT GAT GTC ACA AAA TGT GGA GCA 
Tyr Glu Lys Tyr Asp Val Thr Lys Cys Gly Ala 
535 540 

GAA TAT ATG TCC CAA ACG GGT TTC GGA TGG TCA 
Glu Tyr Met Ser Gin Thr Gly Phe Gly Trp Ser 
550 555 560 



ACA TCT GAG GAT ATT 
Thr Ser Glu Asp He 
420 

TCA TTT GCC TCT AAT 
Ser Phe Ala Ser Asn 
435 

GAT AAT AAT ATC ACA 
Asp Asn Asn He Thr 
450 

GGC TTG CTT CAG CCT 
Gly Leu Leu Gin Pro 
465 

CAG CAA TGG GAT TTT 
Gin Gin Trp Asp Phe 
485 

ATT GAA GGT CTC TTA 
lie Glu Gly Leu Leu 
500 

AAA GAC ATT GCT ATT 
Lys Asp lie Ala lie 
515 

AAA ACC GGT GCT ATG 
Lys Thr Gly Ala Met 
530 

TAT GGA GGT GGT GGT 
Tyr Gly Gly Gly Gly 
545 

AAT GGC GTT GTA CTG 
Asn Gly Val Val Leu 
565 



GCA CTT CTA GAG GAA TTT GGA TGG CCT GAA GAT 
Ala Leu Leu Glu Glu Phe Gly Trp Pro Glu Asp 
570 * 575 

TAATGAGCAA GTAGAAAAGC CAAATGAAAC ATCATTGAGT 

AAATAAGCTG C AATGGT TTG CTGATAGTTT ATGTTTTGTA 

TGTACCATAT CAAGTGATAT TACCATGAAC TATGTCGTTC 

TGCAAAAATA ATGCAGTTTT GGAGAATCCG ATAACATAGA 

TAAACAGCTT ACTATATTAA GTAAAAGAAA GATGATTCCT 



TTG AAG ATT GAT TGC 
Leu Lys He Asp Cys 
580 

TTTATTTTCT TCTTTTGTTA 
TTACTATTTC ATAAGGTTTT 
GGACTCT TCA AATCGGATTT 
CCATGTATGG ATCTAAATTG 
CTGCTTTAAA AAAAAAAAAA 



1423 

1471 

1519 

1567 

1615 

1663 

1711 

1759 

1807 

1855 

1903 

1963 
2023 
2083 
2143 
2203 
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5 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 amino acids 

(B) TYPE: amino acid 
w (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

75 

Met Gly Lys Ala He He Phe Met He Phe Thr Met Ser Met Asn Met 
1 5 10 15 

He Lys Ala Glu Thr Cys Lys Ser He Asp Lys Gly Pro Val He Pro 

20 20 25 30 

Thr Thr Pro Leu Val He Phe Leu Glu Lys Val Gin Glu Ala Ala Leu 
35 «»> 45 

25 Gin Thr Tyr Gly His Lys Gly Phe Asp Ala Lys Leu Phe Val Asp Met 
50 " 55 «0 

Ser Leu Arg Glu Ser Leu Ser Glu Thr Val Glu Ala Phe Asn Lys Leu 
65 70 75 80 

Pro Arg Val Val Asn Gly Ser He Ser Lys Ser Asp Leu Asp Gly Phe 
85 90 95 

He Gly Ser Tyr Leu Ser Ser Pro Asp Lys Asp Leu val Tyr Val Glu 
35 100 105 HO 

Pro Met Asp Phe Val Ala Glu Pro Glu Gly Phe Leu Pro Lys Val Lys 
115 120 125 



2207 



40 



Asn Ser Glu Val Arg Ala Trp Ala Leu Glu Val His Ser Leu Trp Lys 
130 135 140 



Asn Leu Ser Arg Lys val Ala Asp His Val Leu Glu Lys Pro Glu Leu 
45 145 150 155 160 

Tyr Thr Leu Leu Pro Leu Lys Asn Pro Val He He Pro Gly Ser Arg 
165 170 175 

so Phe Lys Glu Val Tyr Tyr Trp Asp Ser Tyr Trp Val He Arg Gly Leu 
180 185 190 

Leu Ala Ser Lys Met Tyr Glu Thr Ala Lys Gly He Val Thr Asn Leu 
195 200 205 

55 
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Val Ser Leu He Asp Gin Phe Gly Tyr Val Leu Asn Gly Ala Arg Ala 
210 215 220 

Tyr Tyr Ser Asn Arg Ser Gin Pro Pro Val Leu Ala Thr Met lie Val 
225 2 30 235 " 240 

Asp He Phe Asn Gin Thr Gly Asp Leu Asn Leu Val Arg Arg Ser Leu 
245 250 255 

Pro Ala Leu Leu Lys Glu Asn His Phe Trp Asn Ser Gly lie His Lvs 
2 «0 265 2 70 * 

Val Thr He Gin Asp Ala Gin Gly Ser Asn His Ser Leu Ser Arg Tvr 
275 280 285 

Tyr Ala Met Trp Asn Lys Pro Arg Pro Glu Ser Ser Thr He Asp Ser 
290 295 300 

Glu Thr Ala Ser Val Leu Pro Asn He Cys Glu Lys Arg Glu Leu Tyr 
305 310 315 320 

Arg Glu Leu Ala Ser Ala Ala Glu Ser Gly Trp Asp Phe Ser Ser Ara 
325 330 335 

Trp Met Ser Asn Gly Ser Asp Leu Thr Thr Thr Ser Thr Thr Ser He 
340 345 350 

Leu Pro Val Asp Leu Asn Ala Phe Leu Leu Lys Met Glu Leu Asp He 
355 360 3 6 5 

Ala Phe Leu Ala Asn Leu Val Gly Glu Ser Ser Thr Ala Ser His Phe 
370 375 3 8 o 

Thr Glu Ala Ala Gin Asn Arg Gin Lys Ala He Asn Cys He Phe Trp 
385 390 395 400 

Asn Ala Glu Met Gly Gin Trp Leu Asp Tyr Trp Leu Thr Asn Ser Asp 
405 410 415 

Thr Ser Glu Asp He Tyr Lys Trp Glu Asp Leu His Gin Asn Lys Lys 
420 425 430 

Ser Phe Ala Ser Asn Phe Val Pro Leu Trp Thr Glu He Ser Cys Ser 
43 5 440 445 

Asp Asn Asn He Thr Thr Gin Lys Vai Val Gin Ser Leu Met Ser Ser 
450 455 4 6 o 

Gly Leu Leu Gin Pro Ala Gly He Ala Met Thr Leu Ser Asn Thr Gly 
465 47 ° 475 480 

Gin Gin Trp Asp Phe Pro Asn Gly Trp Pro Pro Leu Gin His He He 
485 490 495 
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He Glu Gly Leu Leu Arg Ser Gly Leu Glu Glu Ala Arg Thr Leu Ala 



500 



505 510 



Lys Asp He Ala He Arg Trp Leu Arg Thr Asn Tyr Val Thr Tyr Lys 
515 520 525 

Lys Thr Gly Ala Met Tyr Glu Lys Tyr Asp Val Thr Lys Cys Gly Ala 
530 535 540 

Tyr Gly Gly Gly Gly Glu Tyr Met Ser Gin Thr Gly Phe Gly Trp Ser 
545 550 555 560 

Asn Gly Val Val Leu Ala Leu Leu Glu Glu Phe Gly Trp Pro Glu Asp 
565 570 575 

Leu Lys He Asp Cys 
580 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
25 (3) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



(ix) FEATURE: 
35 (A) NAME /KEY : modif ied_base 

(B) LOCATION: 6 

<D) OTHER INFORMATION: /mod_base= i 



(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGYGGNMGMT TYRWNGARKT MTAYKRYTGG GAC 33 
so (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
5 (iii) HYPOTHETICAL: YES 



(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /modJ>ase= i 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: S 

(D) OTHER INFORMATION: /modJbase= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ie djbase 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /modjbase- i 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME / KEY : modif iedjbase 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /modjbase- i 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /mod base- i 



40 < xi > SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GTNCCNGGNG GNCGNTTYRW NGARKT 
(2) INFORMATION FOR SEQ ID NO: 13: 

4S 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 
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(ix) FEATURE: 

(A) NAME / KEY : modif iedjbase 

(B) LOCATION: 3 

5 (D) OTHER INFORMATION: /mod_base«= i 



(ix) FEATURE: 

(A) NAME /KEY : modif iedjbase 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= i 



(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: t 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GGNGGYTGNS WNCGNYRNAG RTARTA 



(2) INFORMATION FOR SEQ ID NO: 14: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



45 

(ix) FEATURE: 

(A) NAME /KEY : modif iedjbase 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /modjbase- i 

50 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /mod_base= i 
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(ix) FEATURE: 

<A> NAME/KEY: modif ied_base 
(B) LOCATION: 19 

(D) OTHER INFORMATION: /mod_base« 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 22 

(D) OTHER INFORMATION: /mod base= 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

NSCRTTNRYC CATCCRAANC CNTC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(B> TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGAAACGGGC CCATCAATTA 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TCGATGAGAT CAATGCCGAG 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
CCATCCTAAT ACGACTCACT ATAGGGC 
<2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18 
CACAACAGGC TGGTATCCCG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAATAACGAA CTGGGAAGCC 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ACTCACTATA GGGCTCGAGC GGC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



( ix) FEATURE : 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /modjDase- i 

(ix) FEATURE: 

(A) NAME/KEY; modif iedjbase 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /modbase* i 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /modjbase- i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod_base= i 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GAYNTNATNT GGRTNCAYGA YTAYCA 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



FEATURE : 
(A) NAME /KEY : modif ied_base 
(B> LOCATION: 3 

<D) OTHER INFORMATION: /mod_base= i 

FEATURE : 
<A) NAME /KEY: modif ied_base 
<B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 

FEATURE : 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base« i 

FEATURE : 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

40 

CCNACNGTRC ANGCRAANAC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) 

20 



(ix) 



25 



(ix) 

30 



(ix) 

35 



45 



50 
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(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /modJ>ase= i 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/ KEY: modif ied_base 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /modJbase= i 

(ix) FEATURE: 

(A) NAME /KEY: modif iedjbase 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 20 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE ; 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 23 

(D) OTHER INFORMATION: /mod base- i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TNGGNTKNTT YYTNCAYAYN CCNTTYCC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: modif iedjaase 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 
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(ix) 



FEATURE : 
(A) NAME /KEY : modi fie debase 
<B) LOCATION: 9 

(D) OTHER INFORMATION: /mod base- i 



5 



(ix) 



FEATURE : 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod base- i 



w 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



(2) INFORMATION FOR SEQ ID NO: 25: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME /KEY : modif iedjbase 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod base- i 



(ix) FEATURE: 

(A) NAME/KEY: modif iedjoase 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod base«= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



15 



TGRTCNARNA RYTCYTTNGC 



45 



SO 



55 



CCRTGYTCNG CNSWNARNCC 
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(2) INFORMATION FOR SEQ ID NO : 26: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 



5 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) 



MOLECULE TYPE: CDNA 



(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 
<B) LOCATION: 17 

(D) OTHER INFORMATION: /mod base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TCRTCNGTRA ARTCRTCNCC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 22 base pairs 
(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= i 
(ix) FEATURE: 



(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod__base«= i 
(ix) FEATURE: 



2B 



40 



SO 



(A) NAME /KEY: modified base 



55 



(A) NAME/KEY: modif ied_base 

(B) LOCATION: 15 
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(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/ KEY : modif iedjbase 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GYNACNARRT TCATNCCRTC NC 22 



Claims 

1. A process for producing trehalose in plant cells capable of producing trehalase by growing plant cells having the 
genetic information required for the production of trehalose and trehalase, or cultivating a plant or a part thereof 
comprising such plant cells, characterised in that said plant cells are grown, or said plant or a part thereof, is 
cultivated in the presence of a trehalase inhibitor 

2. A process according to claim 1, wherein said plant cells have been genetically altered so as to contain a gene 
25 coding for a bipartite trehalose synthesizing enzyme in a plant expressible form. 

3. A process according to claim 1 , wherein said plant cells have been genetically altered so as to contain a chimeric 
trehalose phosphate synthase gene in a plant expressible form, preferably wherein the trehalose phosphate syn- 
thase gene comprises an open reading frame encoding trehalose phosphate synthase from £. coli in plant ex- 

30 pressible form, more preferably wherein the open reading frame encoding trehalose phosphate synthase from E. 

coli is downstream of the CaMV 35S RNA promoter or the potato patatin promoter. 



4. A process according any of claim 1 to 3, wherein a Solanum tuberosum plant is cultivated, preferably wherein said 
plant has micro-tubers. 

5. A process according to claim 4, wherein said plant is cultivated in vitro. 



6. A process according to any one of claims 1 to 5, wherein said trehalase inhibitor comprises validamycin A in a 
form suitable for uptake by said plant cells, said plant, or a part thereof, preferably wherein the concentration of 

40 validamycin A is between 100 nM and 10 mM, more preferably between 0.1 and 1 mM, in aqueous solution. 

7. A process according to any one of claims 1 to 5, wherein said trehalase inhibitor comprises the 86kD protein of 
the cockroach (Pehpianeta americana) in a form suitable for uptake by said plant cells, said plant, or a part thereof. 

4 $ 8. A process according to any one of claims 1 to 5, wherein said plant cells have been genetically altered to contain 
the genetic information for a trehalase inhibitor, preferably wherein the trehalase inhibitor is the antisense gene to 
the gene encoding the information for trehalase or wherein the trehalase inhibitor is the 86kD protein of the Amer- 
ican cockroach (Periplaneta americana). 

50 9. A process according to any one of claims 1 to 8, wherein a plant, or a part thereof, accumulates trehalose in an 
amount above 0.01 % (fresh weight). 

10. A plant, or a part thereof, or plant cells, obtainable by a process according to any one of the claims 1 to 9, which 
contain trehalose in an amount above 0.01% (fresh weight), preferably wherein said plant, or a part thereof is a 

55 Solanaceae species, more preferably Solanum tuberosum or Nicotiana tabacum. 

11. A plant part according to claim 10, which is a tuber or a micro-tuber. 
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12. Tuber or micro-tubers of Solanum tuberosum containing trehalose. 

13. Use of a plant, or plant part, according to claim 10 for extracting trehalose. 

s 14. Use of a plant, or plant part, according to claim 10 in a process of forced extraction of water from said plant or 
plant part. 

15. A plant according to claim 10, which has an increased stress tolerance, preferably increased drought tolerance. 

io 16. A chimaeric plant expressible gene comprising in sequence a transcription initiation region obtainable from a gene 
preferentially expressed in a plant part, particularly the patatin gene from Solanum tuberosum, a 5'-untranslated 
leader, an open reading frame encoding a trehalose phosphate synthase activity, and downstream of said open 
reading frame a transciptional terminator region, preferably wherein said transcriptional terminator region is ob- 
tainable from the proteinase inhibitor-ll gene of Solanum tuberosum. 

17. A plant derived and plant expressible gene encoding a bipartite trehalose synthesizing enzyme. 

18. A vector comprising a chimaeric plant expressible gene according to claim 16 or 17. 

19. A recombinant plant genome comprising a chimaeric gene according to claim 18. 

20. A plant cell having a recombinant genome according to claim 18. 

21. A plant or a part thereof, consisting essentially of cells according to claim 20, preferably a plant from the species 
25 Solanum tuberosum. 

22. A plant part according to claim 21 , which is a tuber or a micro-tuber. 

23. A process for obtaining trehalose, comprising the steps of growing plant cells according to claim 20 or cultivating 
30 a plant according to claim 21, or cultivating a plant part according to any one of claims 21 or 22, extracting trehalose 

from said plant cells, plants or Darts. 



15 



20 



from said plant cells, plants or parts. 

24. A process for obtaining trehalose, comprising the steps of producing trehalose in plant cells, a plant or a part 
thereof, according to a process of any one of claims 1 to 9, and separating or extracting trehalose from said plant 
35 cells, plant or part thereof. 



40 • 



45 



50 
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Fig. 2 
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CpffiPflrifiOn Qf trehalase sequences derived from diff e rent-. ftp**fl if?fii 



1 50 

Ecoll2treh MLNQKIQN PNPDELMIEV DLCYELDPVTE 

Ecolitreha MKSPAPS RPQKMALIPA CIFLCFAALS 

Bomnvotreha MRLF LLLVGLTTV. 

Tenmotreha M1PF LLMVAFADTV 

Rabbitreha MPGSTWELHL LLiiLGLG. . . 

Potatotreha MGKAIIFM IPMSMNMIK 

Yeasttreha MSVFDNVSPF KKTGFGKLQQ TRRGSEDDTV SSSQGNRRFF IEDVDKTLNE 

51 100 

Ecoli2treh LKLDEMIEAE PEPQ4IBGLP ASDALTPADR YL ELFEH 

Ecolitreha VQAEETFVTP QPPDILLG. . PLFND 

Bonmotreha . . IADDLPPT CIRPVY . C NS I . ] !tLUHY 

Tenmotreha LQVSAQSQPS CDSKVY C QG KLLHV 

Rabbitreha LGSEQALPPP CESQIY C HG ELLHQ 

Potatotreha AETCKSIDKG PVIPTT p lv IFLEK 

Yeasttreha LLAAEDTOKN YQITIEDTGP KVLKVGTANS YGYKHINIRG TYMLSNLLQE 



Ecoli2treh 

Ecolitreha 

Bommotreha 

Tenmotreha 

Rabbitreha 

Potatotreha 

Yeasttreha 



Ecoli2treh 

Ecolitreha 

Bomnvotreha 

Tenmotreha 
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